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METHOD AND SYSTEM FOR INTERNET PERSONALIZATION 

BACKGROUND OF THE INVENTION 

5 1 . Field of the Invention: 

The present invention relates generally to a method and system for creating a 
personalized display for a user of an electronic network. More specifically, the present 
invention relates to a method and system for determining a user's interests from the content 
of electronic documents viewed by the user and providing recommended documents and 
1 0 recommendation packages to a user based upon the determined interests. 

2. Description of Related Art: 

The number of Internet users continues to increase at an explosive rate. The World 
Wide Web ("Web") has therefore now become a significant source of information, as well as 
1 5 products and services. 

As the numbers of Web users rise, Internet commerce, also referred to as "e- 
commerce" companies, and content providers are increasingly searching for strategies to 
target their information, products and services to those Web users. One technique that is 
currently being used to provide Web users with more relevant and timely information is 
20 "personalization." 

Personalization can include sending a user an e-mail message tailored to that user, or 
providing customized Web pages that display information selected by, or considered of 
interest to the user. Personal merchandising, in which a unique view of an online store, 
featuring offerings targeted by customer profile is displayed, is another effective 
25 personalization technique. Personalization facilitates the targeting of relevant data to a select 
audience and can be a critical factor in determining the financial success of a Web site. 

Internet companies wishing to create highly personalized sites are currently poorly 
served by both personalization technology vendors and customer relationship marketing 
product vendors. Each of these vendors offers only part of the overall solution. In addition, a 
30 significant investment of time and resources by the client is required to deploy these current 
solutions. 

Most prior art personalization and Web user behavior (also known as clickstream) 
analysis technologies maintain a record of select Web pages that are viewed by users. This 
record, known as the "Web log" records which users looked at which Web pages in the site. 
35 A typical Web log entry includes some form of user identifier, such as an IP address, a cookie 
ID or a session ID, as well as the Uniform Resource Locator ("URL") the user requested, e.g. 
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"index.html." Additional information such as the time the user requested the page or the 
page from which the user linked to the current Web page can also be stored in the Web log. 

Traditionally, such data has been collected in the file system of a Web server and 
analyzed using software, such as that sold by WebTrends and Andromedia.. These analyses 
5 produce charts displaying information such as the number of page requests per day or the 
most visited pages. No analysis is performed of the internal Web page structure or content. 
Rather, this software relies on simple aggregations and summarizations of page requests. 

The prior art personalization methods also rely on the use of Web logs. One 
technology used in prior art personalization methods is the trend analysis method known as 

10 collaborative filtering. Examples of collaborative filtering systems are those of Net 

Perceptions (used for Amazon.com' s book recommendations), Microsoft's Firefly, Personify, 
Inc., and HNC Software Inc.'s eHNC. 

One method of collaborative filtering is trend analysis. In trend analysis collaborative 
filtering, the pages requested by a user are noted, and other users that have made similar 

15 requests are identified. Additional Web pages that these other users have requested are then 
recommended to the user. For example, if User A bought books 1 and 2 from an on-line 
bookseller, a collaborative filtering system would find other users who had also bought books 
1 and 2. The collaborative filtering system locates 10 other users who on average also bought 
books 3 or 4. Based upon this information, books 3 and 4 would be recommended to User A. 

20 Another type of collaborative filtering asks the users to rank their interest in a 

document or product. The answers to the questions form a user profile. The documents or 
products viewed by other users with a similar user profile are then recommended to the user. 
Systems using this technique include Reel.com 5 s recommendation system. 

However, collaborative filtering is not an effective strategy for personalizing dynamic 

25 content. As an example, each auction of a Web-based auction site is new and therefore there 
is no logged history of previous users to which the collaborative filtering can be applied. In 
addition, collaborative filtering is not very effective for use with infrequently viewed pages or 
infrequently purchased products. 

Another technique used to personalize Internet content is to ask the users to rank their 

30 interests in a document. Recommendations are then made by finding documents similar in 

proximity and in content to those in which the user has indicated interest. These systems may 
use an artificial intelligence technique called incremental learning to update and improve the 
recommendations based on further user feedback. Systems using this technique include 
SiteHelper (Ngu and Wu, 1997), Syskill & Webert (Pazzani et al., 1996) Fab (Balabanovic, 

35 1997), Libra (Mooney, 1998) and Web Watcher (Armstrong et al., 1995). 
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Another technique that has been used to personalize Internet content is link analysis. 
Link analysis is used by such systems as the search engine Direct Hit and Amazon.com's 
Alexa®. The prior art link analysis systems are similar to the trend analysis collaborative 
filtering systems discussed previously. In the link analysis systems, however, the URL of a 
5 web page is used as the basis for determining user recommendations. 

Other prior art personalization methods use content analysis to derive inferences 
about a user's interests. One such content analysis system is distributed by the Vignette 
Corporation. In the content analysis method, pages on a client's Web site are tagged with 
descriptive keywords. These tags permit the content analysis system to track the Web page 

10 viewing history of each user of the Web site. A list of keywords associated with the user is 
then obtained by determining the most frequently occurring keywords from the user's history. 
The content analysis system searches for pages that have the same keywords for 
recommendation to the user. 

This prior art content analysis systems is subject to several disadvantages. First, 

1 5 tagging each page on the client's Web site requires human intervention. This process is time- 
consuming and subject to human error. The prior art content analysis systems can only offer 
recommendations from predefined categories. Furthermore, the prior art content analysis 
systems require a user to visit the client's Web site several times before sufficient data has 
been obtained to perform an analysis of the user's Web page viewing history. 

20 Other prior art content analysis systems automatically parse the current document and 

represent it as a bag of words. The systems then search for other similar documents and 
recommend the located documents to the user. Such systems include Letizia (Lieberman, 
1995) and Remembrance Agent (Rhodes, 1995). These content analysis systems base their 
recommendations only on the current document. The content of the documents in the user's 

25 viewing history are not used. 

Many Web sites offer configurable start pages for their users. Examples of 
configurable start pages include My Yahoo! and My Excite. To personalize a start page 
using the prior art method, the user fills in a form describing the user's interests. The user 
also selects areas of interest from predefined categories. The user's personalized start page is 

30 then configured to display recommendations such as Web pages and content-based 
information that match the selected categories. 

This prior art method, however, is not automated. Rather, the user's active 
participation is required to generate the personalized Web start page. Furthermore, pages on 
the client's Web site must be tagged to be available as a recommendation to the user. In 

35 addition, recommendations can only be offered from predefined categories. Thus, the prior 



3 



PATENT 

Attorney Docket No.: 358.02 



art personalized start pages may not provide relevant content to users who have eclectic 
interests or who are not aware of or motivated to actively create a personalized start page. 

Content Web sites are increasingly generating income by using advertising directed at 
users of the Web sites. In the prior art, advertising was targeted to users by using title 
5 keywords. In this method, keywords in the title of a Web page or otherwise specified by the 
author of the page are compared with the keywords specified for a particular advertisement. 
Another technique used is to associate specific ads with categories in a Web site. For 
example, advertisements for toys might be associated with Web site categories related to 
parenting. However, these prior art methods require human intervention to select the 
1 0 keywords or to determine the associations of advertisements with particular categories. 

Furthermore, the prior art methods cannot readily be used to target advertisements to dynamic 
content. 

It would therefore be an advantage to provide a method and system for providing 
Internet end users with relevant and timely information that is rapid to deploy, easy and 
15 inexpensive for client Web sites to use. It would be a further advantage if such method and 
system were available to automatically and dynamically determine the interests of a user and 
recommend relevant content to the user. It would be yet another advantage if such method 
and system were available to provide for a user a personalized recommendation package, 
such as an automatically generated start page for each user who visits a Web site. 

20 

SUMMARY OF THE INVENTION 



The present invention is a computer-implemented method and system for creating a 
personalized display for a user of an electronic network. The method can be used with any 

25 electronic network including the Internet and, more specifically, the World Wide Web. 

The present invention provides a user personalization service to businesses and 
organizations, preferably e-commerce and Internet businesses, that provide document servers. 
In the preferred embodiment of the invention, a tracking module is installed at a client 
document server. In the presently preferred embodiment, the client document server is a Web 

30 server. The tracking module determines whether each user who views a document on the 
client document server has a user-identifier ("user ID") installed on the user's computer. In 
the preferred embodiment of the invention, this user ID is in a cookie that is sent to the user's 
Web browser application. If no user ID is located, the tracking module sends a user ID to the 
user's computer. 

35 The tracking module logs every request made by every user for documents on the 

client document server. The tracking module sends the user ID and a document identifier 
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("document ID") to a marketing system in response to each user's request to view a 
document on the client document server. This information, including both document data 
and content data related to the user's activities and behavior while viewing the document on 
the client document server, is stored on a database associated with the marketing system. 
5 In the preferred embodiment, recommendation software operable on the marketing 

system is then used to analyze each document a user views in terms of the (a) content viewed 
by the user and (b) ancillary information. The recommendation software uses customizable 
filters that extract from the document only the content deemed to be relevant to a user. In 
addition to extracting the content of each page, the recommendation software uses filters to 
1 0 extract structure within this content. 

Given a set of documents that a user has viewed, the recommendation software is 
used to determine any common themes and concepts of the set. The recommendation 
software uses a document clustering process to group together those documents of the client 
document server that have been viewed by the user according to their common themes and 
;pl5 concepts. For each individual user, the recommendation software clusters those documents 
that have the most themes and concepts in common with one another into interest folders. 

The set of interest folders, in combination with any other structural information 
regarding the user's behavior, is used by the recommendation system to derive a user interest 
profile. The user interest profile comprises one or more interest folders. In the preferred 
J20 embodiment, each interest folder is automatically summarized in terms of the most relevant 

keywords from the associated collection of documents in the interest folder. The user's profile 
I can be summarized using these keywords to facilitate the automatic deriving of user's interest 
? and intentions in viewing content and advertisements. Filters, such as the removal of 
"stopwords", can be used to screen out common prepositions, articles, possessives and 
25 irrelevant verbs, adjectives and nouns. 

The recommendation software can also be used to categorize documents to 
automatically place new documents into a user's existing predefined interest folders. 
The present invention uses the user's interest profile (the product of the analysis of document 
content) to provide a model for automatically deriving a ranking of user's interests and 
30 intentions in viewing particular documents. This model can then be used to rank the new 
documents so that the most relevant documents to a user's interests folder are automatically 
placed in that interest folder. 

The model can also be used to determine "recommended documents" and 
"recommendation packages", which contain suggested products and/or documents that are 
35 tailored to a user's interests and to the product and/or document that a user is currently 
viewing. In the preferred embodiment of the invention, the recommendation package 
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is a personalized Web start page for the user. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 Figure 1 is a flow diagram of the personalization method according to the present invention 
Figure 2 is a block diagram of a computer network system according to one embodiment of 
the present invention. 

Figure 3 is a diagram of the system for Internet personalization, according to the preferred 
embodiment of the invention. 
10 Figure 4 is a flow chart of the method for Internet personalization, according to the preferred 
embodiment of the invention 

Figure 5 is a flow chart illustrating the formation of interest folders, according to the present 
invention. 

Figure 6 is an example of a user profile generated by the recommendation software, 
1 5 according to the preferred embodiment of the present invention. 

Figure 7 is an example of a recommendation start page according to the preferred 
embodiment of the present invention. 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention is a computer-implemented method and system for creating a 
personalized display for a user of an electronic network. The method can be used with any 
5 electronic network including the Internet and, more specifically, the World Wide Web. The 
preferred embodiment of the present invention includes components for analyzing Web user 
behavior, for remote user tracking, and for interacting with the user. 

In the following description, for purposes of explanation, numerous specific details 
are set forth in order to provide a thorough understanding of the present invention. It will be 
10 evident, however, to one skilled in the art that the present invention may be practiced without 
the specific details. In other instances, well-known structures and devices are shown in block 
diagram form to facilitate explanation. The description of preferred embodiments is not 
intended to limit the scope of the claims appended hereto. 

15 Features of the Invention: 

The present invention provides a user personalization service to businesses and 
organizations that provide document servers. In the preferred embodiment, the invention is 
directed primarily to e-commerce and Internet businesses. The invention can be used to 

20 provide personalization and Web user behavior (referred to herein as "clickstream") analysis. 
This service enables e-commerce and Internet sites to deliver highly personalized and 
relevant information to each of their users. The invention can be used with, but is not limited 
to, content sites and e-commerce sites.. 

Figure 1 is a flow diagram of the personalization method according to the present 

25 invention. The invention uses the recommendation software to remotely collect and process 
end user behavior 100. Each user action is considered and analyzed in terms of the structural 
content of the document that is actually viewed by the user 105. The interests of the user are 
determined 1 10 and the user can thereby be provided with a list of recommended documents 
that are selected according to the analysis of the content of the documents viewed by the user 

30 1 15. In addition, the invention can also be used to generate a personalized recommendation 
package, such as, in the preferred embodiment, a personalized start page or a personalized 
product catalogue for each user. 

The present invention is advantageous because, by having more relevant information 
delivered to each end user, the client can draw users back to the client document server and 

35 can create a barrier to their switching to a competing document server. This can result in 

increased advertising revenue accruing to the client, and e-commerce clients can receive more 
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revenue from sales because each user will receive more relevant suggestions of products to 
buy and will return more regularly. 

The invention offers significant advantages to clients over the prior art personalization 
methods. For example, using the invention, a personalized recommendation package can be 
5 rapidly deployed, with minimal effect on the original client document server during 

deployment. The present invention avoids the requirement for clients to develop and invest 
in complex techniques for their own tracking and personalization and is therefore more 
economical than prior art personalization schemes. In addition, the present invention will 
enable clients to retain customers through improved one-to-one interaction as well as drive 
10 revenue from increased sales through cross-selling and up-selling of their products. 

Definitions: 

For purposes of this application, the present invention will be referred to as the 
"recommendation system". The use of the term recommendation system is in no way 

1 5 intended to limit the scope of the present invention as claimed herein. As described in further 
detail herein, the recommendation system can include any suitable and well-known hardware 
and software components, and in any well-known configuration to enable the implementation 
of the present invention. 

The present invention is also implemented using one or more software applications 

20 that are accessible to the recommendation system. For purposes of this application, these 
software applications will be called the "recommendation software." The use of the term 
recommendation software is in no way intended to limit the scope of the present invention as 
claimed herein. 

The personalization service according to the present invention is preferably provided 
25 by an entity, referred to for purposes of this application as the market analyst. The term 
"client," as used herein, refers to the operator of a document server. In the preferred 
embodiment of the present invention, the client is the operator/owner of a Web site. The term 
"user" refers herein to an individual or individuals who view a document served by the 
client's document server. 
30 The recommendation system can include the market analyst's computers and network 

system, as well as any software applications resident thereon or accessible thereto. For 
purposes of this application, these components will be collectively referred to as the 
"marketing system." The use of the term marketing system is in no way intended to limit the 
scope of the present invention as claimed herein. As described herein, the marketing system 
35 can include any suitable and well-known hardware and software components, and in any 
well-known configuration to enable the implementation of the present invention. In the 
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presently preferred embodiment, the marketing system is maintained separately from the 
client document server. However, in alternative embodiments, the hardware and software 
components necessary to provide the personalization service can be a part of the client 
document server. In these alternative embodiments, the hardware and software components 
5 can be operated by, for example, a client e-commerce or Internet business itself 

The client's computers and network system, as well as any software applications 
resident thereon or accessible thereto will be collectively referred to, for purposes of this 
application, as the "document server." The term "document" is used to represent the display 
viewed by a user. In a Web-based embodiment, the document is a Web page. In an e-mail 
10 embodiment, the document can be an e-mail message or listing of messages, such as an in- 
box. 

As used herein, the term "database" refers to a collection of information stored on one 
or more storage devices accessible to the recommendation system and recommendation 
software, as described previously. The use of the term database is in no way intended to limit 

15 the scope of the present invention as claimed herein. 

The database according to the present invention can include one or more separate, 
interrelated, distributed, networked, hierarchical, and relational databases. For example, in 
the presently preferred embodiment of the invention, the database comprises a document 
database and a user database. The database can be created and addressed using any well- 

20 known software applications such as the Oracle 8™ database. The database according to the 
present invention can be stored on any appropriate storage device, including but not limited 
to a hard drive, CD-ROM, DVD, magnetic tape, optical drive, programmable memory device, 
and Flash RAM. 

The term "content sites" refers to Internet sites that are primarily providers of content 
25 based information such as news articles. Examples of content Web sites include CNET, 

MSN Sidewalk, and Red Herring. These sites can generate income from advertising, as well 
as syndication or referral fees for content. A content site's income can therefore be greatly 
dependent upon the Web site's ability to retain users. 

E-commerce sites are Internet sites whose primary business is the sale of goods or 
30 services. E-commerce businesses derive revenue from the sale of goods on their Web sites. 
A significant factor in the success of an e-commerce Web site is the site's ability to attract 
and retain customers. 

Syndicated content, as used herein, refers to other publisher's content that can be 
integrated into a client's document server. 

35 

Hardware Implementation: 



9 



PATENT 

Attorney Docket No.: 358.02 



Any or all of the hardware configurations of the present invention can be 
implemented by one skilled in the art using well known hardware components. In the 
presently preferred embodiment, the present invention is implemented using a computer. 
Such computer can include but is not limited to a personal computer, network computer, 
5 network server computer, dumb terminal, local area network, wide area network, personal 
digital assistant, work station, minicomputer, and mainframe computer. The identification, 
search and/or comparison features of the present invention can be implemented as one or 
more software applications, software modules, firmware such as a programmable ROM or 
EEPROM, hardware such as an application-specific integrated circuit ("ASIC"), or any 

1 0 combination of the above. 

Figure 2 is a block diagram of a computer network system 200 according to one 
embodiment of the present invention. Any or all components of the recommendation system, 
the marketing system, the client document server, and the user's computer can be 
implemented using such a network system. In computer network system 200, at least one 

15 client document server computer 204 is connected to at least one user computer 202 and to at 
least one marketing system computer 212 through a network 210. The network interface 
between computers 202, 204, 212 can also include one or more routers, such as routers 206, 
208, 214 that serve to buffer and route the data transmitted between the computers. 

Network 210 may be the Internet, a Wide Area Network (WAN), a Local Area 

20 Network (LAN), or any combination thereof. In one embodiment of the present invention, 
the client document server computer 204 is a World-Wide Web ("Web") server that stores 
data in the form of ' Web pages' and transmits these pages as Hypertext Markup Language 
(HTML) files over the Internet network 210 to user computer 202. Similarly, the marketing 
system computer can also be a WWW server. Communication among computers 202, 204, 

25 212 can be implemented through Web-based communication. In some embodiments of the 
present invention, computers 202, 204, and 212 can also communicate by other means, 
including but not limited to e-mail. It should be noted that a network that implements 
embodiments of the present invention may include any number of computers and networks. 

30 Software Implementation: 

Any or all of the software applications of the present invention can be implemented 
by one skilled in the art using well known programming techniques and commercially- 
available or proprietary software applications. The preferred embodiment of the present 
invention is implemented using an Apache Web server and Web-based communication. 

35 However, one skilled in the art will recognize that many of the steps of the invention can be 
accomplished by alternative methods, such as by e-mail. 
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In the preferred embodiment of the invention, the operating system for the marketing 
system is Red Hat™ Linux®. However, any other suitable operating system can be used, 
including but not limited to Linux®, Microsoft Windows 98/95/NT, and Apple OS. 
The recommendation software can include but is not limited to a Web server 
5 application for designing and maintaining the market analyst's Web site, a database 

application for creating and addressing the database, software filters for screening the content 
of documents served by the client's document server, a text clustering application, a text 
categorization program, a presentation module, a spider and/or search engine for seeking 
relevant documents, an e-mail application for communication with users, a spread sheet 

10 application, and a business application for verifying orders, credit card numbers, and 
eligibility of customers. 

The recommendation software can include any combination of interrelated 
applications, separate applications, software modules, plug-in components, intelligent agents, 
cookies, JavaBeans™, and Java™ applets. (Java and all Java-based marks are trademarks or 

1 5 registered trademarks of Sun Microsystems, Inc. in the United States and other countries.) 
The software applications that comprise the recommendation software can be stored on any 
storage device accessible to the marketing system, including but not limited to a hard drive, 
CD-ROM, DVD, magnetic tape, optical drive, programmable memory device, and Flash 
RAM. It will be readily apparent to one of skill in the art that the software applications can 

20 be stored on the same or different storage devices. 

In the preferred embodiment of the invention, the clustering application is 
implemented using the C programming language. However, in alternative embodiments, the 
clustering application can be implemented using other well-known programming languages, 
including but not limited to C++, Pascal, Java, and Fortran. The clustering application is 

25 preferably stored on the marketing system, but can alternatively be stored on any component 
accessible to the marketing system. 

In the preferred embodiment of the invention, the presentation module is implemented 
using Perl scripts and SQL. However, in alternative embodiments, the presentation module 
can be implemented in any other suitable programming language. The presentation module is 

30 preferably stored on the marketing system, but can alternatively be stored on any component 
accessible to the marketing system. 

In the preferred embodiment of the invention, the tracking module that is associated 
with the client's document server is implemented using Perl scripts. However, in alternative 
embodiments, the tracking module can be implemented using other well-known 

35 programming languages and software applications including but not limited to TCL, Java™ 
servlet, and Microsoft Active Server Page ("ASP") applications. The tracking module is 
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preferably stored on the client's document server, but can alternatively be stored on any 
component accessible to the document server. 

In the preferred embodiment of the present invention, content analysis and the 
generation of the user profiles, recommendations, and recommendation packages are all 
5 performed by the marketing system and recommendation software. However, in alternative 
embodiments of the present invention, any or all of these functions can also be performed by 
the client document server. The client document server performs the functions of data 
collection, data transfer to the marketing system and presentation of the recommendations 
and recommendation packages to the user. 

1 0 In the preferred embodiment of the invention, the database is implemented using Data 

Konsult AB's MySQL. However, in alternative embodiments, the tracking module can be 
implemented using other software applications including but not limited to Postgres, and 
Oracle® and Informix® database applications. The database is preferably stored on the 
marketing system server, but can alternatively be stored on any component accessible to the 

15 marketing system. 

The recommendation software is preferably a separate application from the marketing 
system operating system. However, one skilled in the art will readily recognize that the 
present invention can also be fully integrated into the marketing system operating system. 

20 Description of the Embodiments: 

Figure 3 is a diagram of the system 300 for Internet personalization, according to the 
preferred embodiment of the invention. A tracking module 306 is installed at the client 
document server 304. In the presently preferred embodiment, a Web site manager embeds 
25 Hypertext Markup Language (" HTML") links to the marketing system in the client 

document server and, specifically, on the client document server's start page. While the 
tracking module is implemented as a Perl module embedded in Apache in the preferred 
embodiment, the tracking can alternatively be implemented in other ways, for example using 
hypertext links. 

30 At the client document server 304, the tracking module logs every request made by 

every user for documents and sends this information to the database 310 associated with the 
marketing system 308. In the preferred embodiment of the present invention, the database 
310 includes a document database module 312 for storing information relating to the 
document and contents of the document, and a user database module 3 14 for storing 

35 information relating to the user's document viewing behavior. 
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In the preferred embodiment, each user is sent a user-identifier ("user ID") 316 that is 
stored on the user's computer 302. The tracking module sends the user ID and a document 
identifier ("document ID") 318 to the marketing system 308 in response to each user's 
request to view a document on the client document server 304. The recommendation 
5 software 320 is then used to process this information to construct a profile for the user and to 
make recommendations based thereupon. In the preferred embodiment, the presentation 
module 322 is operable to configure a recommendation package for the user into any desired 
format or appearance. 

Figure 4 is a flow chart of the method for Internet personalization, according to the 

10 preferred embodiment of the invention. A tracking module is installed at a client document 
server. In the preferred embodiment of the present invention, the client document server is a 
Web site. However, in alternative embodiments, the present invention is implemented with a 
client e-mail or File Transfer Protocol ("ftp") system. 

In this preferred embodiment, when a user requests a document on the client 

1 5 document server 400, the tracking module searches for a user ID on the user's computer 405. 
If a user ID is not located, the tracking module creates a new entry in the database and sends 
a user ID to the user's computer 410. In the preferred embodiment, this involves sending a 
cookie to the user's Web browser. However, any other appropriate identifier can 
alternatively be used, such as an IP number. 

20 The tracking module installed at the client document server logs every request made 

by every user for documents and sends this information to the marketing system. Thus, when 
the user requests a different document in the client's document server, the tracking module 
logs this action by sending the user ID and a document identifier ("document ID") to the 
database 415. In the presently preferred embodiment, the document ID is the URL of the 

25 particular Web page. However, other document IDs such as a product number can also be 
used. 

In alternative embodiments of the present invention, the tracking module can send 
additional information, such as the time spent viewing a document and the price of items 
displayed on the document to the marketing system database. The subsequent actions on the 
30 client document server of any user who is entered in the marketing system database are 
similarly recorded in the marketing system database. 

In yet another embodiment of the present invention, the marketing system can act as a 
proxy server. In this embodiment, the tracking module could be installed at either the 
marketing system or the client document server, or at both. In this embodiment, the user 
35 requests documents from the marketing system. In response to such request, the marketing 
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system requests the appropriate documents from the client document server and provides 
them to the user. 

In the preferred embodiment, documents and meta-data about the documents are 
stored in the document database module of the database. The document database can include 
5 other information obtained from the client, such as the price or size of an item. The user 
database module can include information obtained from the user, for example, whether the 
user placed a bid on an item, the user's name and address, which documents were viewed by 
the user, whether the user purchased an item, user profile or the time the user spent viewing a 
particular document. Information obtained from text analysis, document clustering, or 
10 document categorization can also be stored in the user database module. 

As the user browses through the client's document server, the marketing system uses 
the recommendation software to process the user's behavior, analyze the content of the user's 
document views and construct a profile for the user 420. 

The recommendation software uses the information in the user database to make a 
1 5 determination of what interests the particular user. For example a user who browsed an 
auction Web site for antique Roman coins and baseball cards would be determined to have 
two interests. These interests are determined by an analysis of the actual content of each 
browsed document. 

The recommendation software uses any or all of the gathered information about the 

20 user to search through the content on the client's document server to find the local content 
considered most relevant to that particular user 425. In the preferred embodiment of the 
invention, the marketing system regularly retrieves the content for each document and/or 
product on the client document server, for example, once per hour. 

The recommendation software analyzes each document a user views in terms of the 

25 (a) content and (b) ancillary information related to a user's viewing a document. The present 
invention uses this analysis of document content to provide a model for automatically 
deriving reasonable inferences regarding a user's interests and intentions in viewing 
particular documents. This model can then be used to generate a list of additional documents 
on the client document server, or elsewhere such as on another document server, that might 

30 be of interest to the user. These "recommendation documents" and "recommendation 

packages" provide a suggested product and/or document that is tailored to a user's interests 
and to the product and/or document that a user is currently viewing. 

The marketing system sends the recommended document(s), or a link to the 
recommended document(s) back to the client's document server 430. The recommendations 

35 can include but are not limited to URLs, product numbers, advertisements, products, 

animations, graphic displays, sound files, and applets that are selected, based on the user 
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profile, to be interesting and relevant to the user. For example, the most relevant ad for any 
page can be rapidly determined by comparing the current user profile with the description of 
the available advertisements. 

The user recommendations can be provided as a part of a personalized 
5 recommendation package. In the preferred embodiment of the invention, the 

recommendation package is a personalized Web start page for the user. For an e-mail server- 
based embodiment, the recommendation package can be personalized e-mail. The 
recommendation package gives each end user a unique view of the client document server by 
showing information that is relevant to that user. 

10 In the preferred embodiment, the document displayed to the user by the client 

document server includes a hypertext link that is used to access the personalized Web start 
page. When the user clicks on the hypertext link, the personalized start page is dynamically 
generated by the recommendation software at the marketing system. Each user will see a 
different view of the Web site based on the user's personal likes or dislikes, as determined 

1 5 automatically by the user's previous browsing behavior. Such automatic personalization 

minimizes the need for the client to specifically control document server content and permits 
the client to transparently provide information regarding the user's interests. 

When the user clicks on a link to this personalized Web page on the client's document 
server, the personalized page is served to the user from the marketing system. Although the 

20 page is served from the marketing system, the presentation module is operable to configure 
the personalized page to conform to the client's own branding and image, thereby 
maintaining the look and feel of the client's site. In addition, the Uniform Resource Locator 
(" URL") link, which is the "Web address" of the personalized page is configured to appear 
to be a link to the client document server. 

25 In alternative embodiments of the present invention, the personalized Web page does 

not have to maintain the look and feel of the client's document server, but can have any 
desired appearance. In such embodiments, the presentation module is operable to configure 
the recommendation package into any desired format or appearance. Furthermore, there is no 
requirement that URL link provided to the user appear to link the Web page to any particular 

30 Web site. In one embodiment of the present invention, the user can switch back at any time 
to the from the personalized recommendation package, such as the personalized Web start 
page, to a non-personalized document, such as the generic start page of displayed by the 
client document server. 

In another embodiment of the invention, portions of the client's document server can 

35 be mirrored on the marketing system. The recommendation software can then search through 
the mirrored client document server for content relevant to the particular user. The 
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recommendation software can also optionally include syndicated content from the marketing 
system or from the client's syndication providers in the personalized page. New standards 
based on XML such as Information Content Exchange ("ICE") will facilitate the 
incorporation of syndication into Web sites. 

The recommendation software according to the present invention uses information 
regarding the client's document server structure in the personalization analysis. For example, 
if a user typically looks at books in a particular category of a bookseller's Web site, this 
information will be used by the recommendation software, in addition to any content 
information, to create a personalized view of the site for the user. 

Figure 5 is a flow chart illustrating the formation of interest folders, according to the 
present invention. _The recommendation software thereby extracts and organizes the interests 
and document viewing habits of the user. 

In the preferred embodiment of the invention, the recommendation software uses a 
statistical process referred to herein as document clustering to group together those 
documents of the client document server that have been viewed by the user according to their 
common themes and concepts. For each individual user, the recommendation software 
clusters those documents that have the most themes and concepts in common with one 
another into interest folders 505. In the preferred embodiment, the recommendation software 
continually monitors each user and continually updates the user's interest folders and profile. 

The set of interest folders for each user can also be used to target advertisements to 
each user rather than, or in addition to content. In the presently preferred embodiment, each 
advertisement has an associated simple description. This description is specified by the 
creator of the ad. The description can be associated with the advertisement by methods 
including embedding in meta-language tags or in XML. 

Document clustering according to the present invention includes the automatic 
organization of documents into the most intrinsically similar groups or segments. As an 
example of the application of using document clustering, a user who enters the search term 
"Venus" into a search engine will likely receive documents about (a) Venus the planet; and 
(b) Venus the goddess. In the preferred embodiment of the present invention, the search 
results would therefore be clustered accordingly into two separate interest folders. None of 
the concepts in groups (a) and (b) are predefined but are formed as a result of the intrinsic 
similarity of the documents in each cluster. As a result, the clustering framework is very 
flexible for automatic organization of documents into groups. 

In the preferred embodiment of the present invention, the recommendation software 
uses a proprietary clustering algorithm to form the user interest folders. The clustering 
algorithm uses the textual content of the documents viewed by a user, in combination with 
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structural information about the document server, and ancillary information about the user to 
determine the interest folders for a user. 

In an alternative embodiment, a clustering algorithm is also used to segment large 
numbers of users into different user folders. However, one skilled in the art would readily 
5 recognize that any other suitable clustering algorithm could also be used in alternative 
embodiments of the invention. 

One significant feature of the clustering algorithm used by the invention is that the 
output of the algorithm can be readily viewed and understood. Each document cluster 
(interest folder) is described by the most relevant keywords of the documents within the 

1 0 document cluster 510. This feature enables both users and marketers to understand and 
control the degree of personalization and targeting that is made. 

The recommendation software can also be used to categorize documents 515. 
Document categorization is the automatic placement of new documents into existing 
predefined categories. Document categorization is used in the preferred embodiment of the 

1 5 present invention to select, from a database, documents that match a user's interest folders. A 
document categorizer can learn how to place new documents into the correct categories so 
that, for example, a new Web page or product can be automatically placed into the correct 
user interest folder. As an example, given a user interest folder containing documents about 
Roman coins, a document categorizer could select the most relevant products for that user 

20 from a particular Web site. 

Because Web pages are diverse in structure and form, the recommendation software 
uses customizable filters that extract only the content deemed to be relevant to users. In 
addition to extracting the content of each page, the recommendation software uses filters to 
extract structure within this content. The present invention can also use adaptive filtering 

25 algorithms that analyze a Web site and review different filter known structures to 
automatically find an appropriate filter for a particular Web site. 

For example, an on-line bookseller's Web page can display information regarding a 
book that is available for purchase. The Web page can include such structure as: book price, 
author, description, and reviews. The fields of the document database are preferably 

30 customized to the bookseller's Web page such that the names of each of these fields can 

automatically be stored therein. The fields of the user database are similarly configured for 
automatic storage of information obtained from the user. This information is then included in 
the recommendation software's analysis. 

In the preferred embodiment of the invention, the recommendation software uses 

35 proprietary filters that are specific for each Web site. For example, each of two music 
distribution Web sites would have its own specific customized filter. Alternatively, the 
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recommendation software can use filters that are specific for different types of Web sites. As 
an example, the recommendation software can have separate specific filters for such sites as 
auction Web sites, bookseller Web sites, and music Web sites. One skilled in the art would 
recognize that the recommendation software can also use any suitable commercially available 
5 filters. 

In the preferred embodiment, each interest folder is automatically summarized in 
terms of the most relevant keywords from the associated collection of pages in the folder. 
Keywords can be determined, for example, by using an information theoretic measure such as 
"Minimum Message Length" ("MML")to determine the most relevant words to define a 

10 user's interest folder. Filters, such as the removal of "stopwords," can be used to screen out 
common prepositions, articles, possessives, and irrelevant nouns, adjectives, etc. 

The keywords for a user's interest folders can be determined in any appropriate 
manner. In one embodiment of the invention, the message length of sending each word using 
the population frequency of the word is determined. This message length is referred to herein 

15 as the population message length of the word. The message length of sending each word 
using the interest folder's frequency of the word is then determined. This message length is 
termed herein the interest folder message length of the word. For each keyword, the interest 
folder message length of that keyword is then subtracted from the population message length 
of the word. The keywords for the user's interest folders are defined to be the words in which 

20 this distance is the greatest. 

Figure 6 is an example of a user profile 600 generated by the recommendation 
software, according to the preferred embodiment of the present invention. The profile shown 
in the personalized Web page of Figure 6 comprises two different interest folders 602, 604 
for a user of an on-line auction Web site. Each interest folder contains pages which are 

25 intrinsically similar to one another and dissimilar to pages in other interest folders. 

A specific interest folder contains a set of links 610 to auctions the user has viewed 
that are related to the theme of the interest folder. An interest folder can also include 
additional information including but not limited to information regarding the history of the 
user's Internet viewing, recommendations for the user, a summary of the user's purchases. In 

30 the example illustrated in Figure 6, each interest folder also has an associated set of keywords 
612 that summarize the most important concepts of the particular interest folder, as 
determined by the recommendation software. 

In the preferred embodiment of the present invention, the user can display and edit the 
user profile of Figure 6. For example, if the user is no longer interested in Roman antiquities, 

35 this interest folder 612 can be deleted from the user profile. 
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It is common for a user to regularly return to particular Web sites to look for specific 
information having a similar theme. For example, a user of an on-line auction Web site who 
collects Roman coins might frequently return to the antiquities section of the auction Web 
site. The present invention uses the profile of each user to automatically find other relevant 
5 pages in the Web site to recommend to the user. In the previous example, the 

recommendation software would search through all of the auctions currently running on the 
on-line auction Web site to search for those that match most closely with each of the user's 
interest folders. 

The present invention uses a sophisticated search engine that can incorporate any or 
10 all of the content and ancillary information in the user profile. Figure 7 is an example of a 
recommendation start page 700 according to the preferred embodiment of the present 
invention. The user's interest folders 602, 604 are displayed on the recommendation 
document. Each interest folder includes links to documents 610 that the recommendation 
software has selected based upon the user's profile. In the previous example of the Roman 
1 5 coin collector, the folder relating to this interest 604 includes links to auctions for Roman and 
other ancient coins. 

In the preferred embodiment of the present invention, a user can view and manage the 
user's profile. Thus, in the previous example, the user may wish to remove certain sections of 
the profile in order to stop receiving recommendations about Roman coin auctions. The 
20 recommendation software user interface allows users to delete interest folders, add extra 
keywords to an interest folder, or create their own interest folder from pages on a client 
document server. 

Because the user profiles are based primarily on keywords, the present invention 
can be used to not only target a user with content from the same Web site that the user is 
25 currently browsing, but also with content from other Web sites. For example, a user with an 
interest in collecting Roman coins could be automatically targeted with content from on-line 
publications related to antiquities. 

While the present invention is designed to automatically match users with relevant 
content, it is recognized that a client might wish to customize the manner in which users 
30 receive special promotions, event announcements and special news items. In the example of 
the Roman coin collector, a marketer of cruises might wish to target the collector with a 
promotion for a cruise of the Mediterranean. 

To enable marketers to interact easily with their users, the present invention provides 
the functionality to allow a marketer to search through the users' profiles using keywords in a 
35 standard search paradigm. Groups of users can be selected and then matched with relevant 
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content either by hand or automatically using the present invention's content matching 
technology. 

While the invention is described in conjunction with the preferred embodiments, this 
description is not intended in any way as a limitation to the scope of the invention. 
5 Modifications, changes, and variations which are apparent to those skilled in the art can be 
made in the arrangement, operation and details of construction of the invention disclosed 
herein without departing from the spirit and scope of the invention. 

One skilled in the art will readily recognize that, in an embodiment that features Web- 
based interaction between the user, the market analyst, and the marketer, there are many 
1 0 different ways in which communication can be implemented through the Web page graphical 
user interface. For example, this communication can be implemented using elements 
including but not limited to a dialog box, check box, combo box, command button, list box, 
group box, slider bar, text box. 

In the preferred embodiment of the present invention, all client's and users use 
1 5 computer-implemented methods to interact with the market analyst, for example, using a 

Web page or e-mail. However, in alternative embodiments, one or more such customers can 
communicate with the market analyst using other methods of communication, including but 
not limited to telephone, fax, and mail. For example, in one embodiment, a user can request 
modifications to the user's profile by making a telephone call to a client or to the market 
20 analyst. 
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What is claimed is: 

1 . A computer-implemented method for creating a personalized display for a user, 
comprising the steps of: 

creating a database entry for each user of a client document server using a tracking 
5 module; 

tracking requests by the user for at least one document on a client document server 
using the tracking module; 

storing, in a database, information regarding each document tracked by the tracking 
module; and 

10 analyzing the stored information to construct a profile of the user. 

2. The method of Claim 1, further comprising the steps of: 

determining based on the user profile, using the at least one recommendation software 
application, the interests of the user; and 
15 providing the user with recommended information, using the at least one 

recommendation software application, based upon the determined interests of the user. 

3. The method of Claim 2, wherein the recommended information is selected from the group 
consisting of recommendation documents and recommendation packages. 

20 

4. The method of Claim 2, wherein the recommended information is selected from the group 
consisting of advertisements, products, animations, graphic displays, sound files, applets, and 
services. 

25 5. The method of Claim 1, wherein the tracking step comprises the steps of: 

sending a user-identifier to the user's computer using the tracking module; and 
sending the user-identifier and a document identifier to the database when the user 
moves to a different document in the client's document server using the tracking module. 

30 6. The method of Claim 1, wherein the stored information comprises content information. 

7. The method of Claim 1, wherein the client document server is a Web site. 

8. The method of Claim 1, wherein the tracking module comprises embedded HTML links to 
35 the marketing system. 
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9. The method of Claim 1, wherein the analyzing step comprises the steps of: 

parsing, using the at least one recommendation software application, a set comprising 

information regarding at least one document viewed by the user; 

determining, using the at least one recommendation software application, at least one 
5 common theme or concept of the set; and 

clustering into interest folders, using the at least one recommendation software 

application, the at least one document in the set according to the number of common themes 

or concepts in the at least one document. 

10 10. The method of Claim 9, further comprising the step of categorizing at least one 

document, using the at least one recommendation software application, according to the 
common themes or concepts of the interest folders. 

11. The method of Claim 10, further comprising the step of recommending to the user, using 
1 5 the at least one recommendation software application, the at least one categorized document. 

12. The method of Claim 1, wherein the analyzing step further comprises the step of using at 
least a first filter to extract selected content from each document tracked by the tracking 
module. 

20 

13. The method of Claim 12, further comprising the step of using at least a second filter to 
extract structure information from within the extracted content. 

14. The method of Claim 9, further comprising the step of summarizing each interest folder, 
25 using the at least one recommendation software application, to determine at least one 

keyword from the at least one document in the interest folder. 

15. A method of automating market analysis on an electronic network, comprising the steps 

of: 

30 a user viewing, using a computer, a document from a client document server, the 

document containing internal content information; 

the client document server transmitting, using a computer, the internal content 
information of the viewed document to at least one recommendation software application; 
and 
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the user receiving, on the computer, a list of links to at least one documents 
recommended by the at least one recommendation software application responsive to the 
transmitted internal content information. 

5 1 6. The method of Claim 1 5, further comprising the step of storing, using a computer, the 
internal content information in a database. 

17. The method of Claim 16, further comprising the step of analyzing, using a computer, the 
internal content information in the database to determine a profile of the user, wherein the 

1 0 profile is used to recommend the list of links. 

18. A market analysis method comprising the steps of: 

receiving, on a computer, internal content information of at least one document 
accessed by a user; 

1 5 determining, using a computer, the theme and concepts of the at least one document; 

grouping, using a computer, the at least one documents into at least one interest folder 
according to common themes and concepts: and 

summarizing, using a computer, the at least one interest folder to extract at least one 
keyword representing the common themes and concepts of the interest folder. 

20 

19. The method of Claim 18, further comprising the step of deriving, using a computer, an 
interest profile from the user's at least one interest folder. 

20. The method of Claim 19, further comprising the step of modifying, using a computer, the 
25 user's profile in accordance with instructions provided by the user. 

21. The method of Claim 19, further comprising the step of utilizing the interest profile, 
using a computer, to locate documents for recommendation to the user. 

30 22. The method of Claim 19, further comprising the step of utilizing the interest profile, 
using a computer, to generate a personalized recommendation package for the user. 

23. The method of Claim 22, wherein the personalized recommendation package is a 
personalized Web start page. 

35 
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24. The method of Claim 23, further comprising the step of configuring the personalized 
Web start page to maintain the look and feel of the client's document server. 

25. The method of Claim 24, comprising the step of configuring a Uniform Resource 

5 Locator associated with the personalized Web start page in the form of a link to the client 
document server, 

26. The method of Claim 19, further comprising the step of providing, using a computer, 
syndicated information to the user in response to the user's profile. 

10 

27. A method of using a computer to customize information provided to a user over an 
electronic network, comprising the steps of: 

(a) tracking requests by a user of a client document server for at least a first 
requested document on the client document server; 
1 5 (b) extracting filtered content from the at least first requested document; 

(c) analyzing the filtered content of the at least first requested document; 

(d) constructing a profile of the user from the analyzed content; 

(e) determining, based on the user profile, the interests of the user; and 

(f) providing the user with recommended information based upon the determined 
20 interests of the user. 

28 . The method of Claim 27, further comprising the steps of: 

determining the theme or concept of the at least first requested document; 
clustering the at least first requested document into at least one interest folder 
25 according to the theme or concept in the document; 

categorizing at least a second document, according to the theme or concept of the at 
least one interest folder; and 

recommending to the user the at least second categorized document. 

30 29. The method of Claim 28, further comprising the steps of: 

summarizing the at least one interest folder to extract at least one keyword 

representing the theme or concept of the interest folder; and 

searching the at least second document for the at least one keyword to categorize the 

at least second document. 



35 



30. A system for creating a personalized display for a user, comprising: 
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means for tracking requests by a user of a client document server for at least one 
document on the client document server; 

means for extracting filtered content from the requested document; 

means for analyzing the filtered content of the at least one requested document; 
5 means for constructing a profile of the user from the analyzed content; 

means for determining, based on the user profile, the interests of the user; and 

means for providing the user with recommended information based upon the 
determined interests of the user. 

10 3 L The method of Claim 30, further comprising the steps of: 

means for determining the theme or concept of at least a first requested document; 

means for clustering the at least first requested document into at least one interest 
folder according to the theme or concept in the document; 

means for categorizing at least a second document, according to the theme or concept 
1 5 of the at least one interest folder; and 

means for recommending to the user the at least second categorized document. 

32. The method of Claim 3 1 , further comprising the steps of: 

means for summarizing the at least one interest folder to extract at least one keyword 
20 representing the theme or concept of the interest folder; and 

means for searching the at least second document for the at least one keyword to 
categorize the at least second document. 

33. A recommendation system, comprising: 

25 a marketing system comprising at least a first computer with at least one 

recommendation software application operable thereon; 

at least a second computer with at least one client document server operable thereon; 
at least a third computer operable to receive and display to a user at least one 
document from the client document server; 
30 a tracking module associated with the at least one client document server to track the 

display of the at least one document to the user and to provide information regarding the at 
least one document to the marketing system; 

at least one database accessible to the marketing system to store information 
associated with the at least one document displayed by the user; 
35 wherein the recommendation system is operable to analyze the information stored in 

the database to determine a profile of the user; 
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wherein the recommendation system is operable to recommend documents to the user 
in response to the determined profile; and 

wherein the recommendation system is operable to provide a personalized 
recommendation package to the user in response to the determined profile. 

5 

34. A system for providing personalized information to a user, comprising: 

a memory device having embodied therein information relating to at least one 
electronic document viewed by the user; 

a processor in communication with the memory device, the processor configured to: 
10 extract filtered content from the at least first viewed document; 

analyze the filtered content of the at least first viewed document; 

determine the theme or concept of the at least first viewed document; 

cluster the at least first requested document into at least one interest folder according 
to the theme or concept in the document; 
1 5 construct a profile of the user from the analyzed content; 

determine the interests of the user based on the user profile; 

categorize at least second document, according to the theme or concept of the at least 
one interest folder; and 

recommend to the user the at least second categorized document. 

20 

35. A system for promoting a business through an electronic network, comprising: 

a memory device having embodied therein information relating to at least one promotional 
item; 

a processor in communication with the memory device, the processor configured to: 
25 receive a listing of the at least one promotional item; 

provide the listing to at least a first customer; 

receive from the first customer a selection of at least one promotional item from the 

listing; 

transmit the selection to a provider for providing the selected promotional item to the 
30 first customer; 

provide the first customer with an identification key; and 

provide the first customer with an incentive award for each additional customer who 
uses the identification key in association with a selection of at least one promotional item 
from the listing. 
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ABSTRACT 

A computer-implemented method and system is provided for creating a personalized display 
for a user of an electronic network. A tracking module installed at a client document server 
logs user document requests on the client document server. The tracking module sends user 
5 and document identification and content information to a marketing system in response to a 
user's request to view a document on the client document server. Recommendation software 
operable on the marketing system analyzes each viewed document in terms of the content and 
ancillary information viewed by the user. Customizable filters extract from the document 
content deemed relevant to a user and also extract structure from within this content. A 

10 document clustering process is used to group together viewed documents into interest folders 
according to common themes and concepts. Each interest folder is preferably automatically 
summarized in terms of the most relevant keywords from the associated collection of pages in 
the folder. New documents are categorized and automatically placed into existing predefined 
interest folders. The set of interest folders, in combination with any other structural 

1 5 information regarding the user's behavior, is used by the recommendation software to derive 
a user interest profile. A model is thereby provided for automatically deriving reasonable 
inferences regarding a user's interests and intentions in viewing particular documents. In 
accordance therewith, recommendation documents and recommendation packages can be 
provided to target content and advertisements at the user. 
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My Personalized Auction Site - My History 



- Click here tor My Recommended Page 



Welcome to My Auction Site. We've presented auctions below which you have previously browsed. 
jwyiHistdl 
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My Interest 
Folder 0 

Keywords; 
oak furniture 



Item(s) 



Price 



# (Time Auction 
Bids [ ^Ends ; \ I 



4 solid oak antique chairs 



Oak Hall Seat 




Rare Old Oak Watch Crystal Cabinet 



320.00 
200.00 
95.00 



0 
0 
0 



4 days, 18 
hours + 

6 days, 19 
hours + 

6 days, 20 
hours + 



My Interest 
Folder 1 

Keywords: 
coin roman 
ancient silver 
postage 



Item(s) 



Price 



ft 
Bids 



Time Auction 

-'Ends j 



LRG Roman SILVER Coin-98 AD-Rome's Golden Era 
250 IJNCLEANED ANCIENT ROMAN COINS 



CHEAP!!!! 

ROMAN CQIN-60 AD. Emperor NERO M Famos f !! 



Roman SILVER Coin- 200AD- Fmprcss& Peacock!!! 



24.95 
0.65 

18.95 
9.95 



11 
3 
1 



3 days, 3 
hours + 

4 days, 23 
hours + 

5 days, 3 
hours - 1 

6 days. 20 j 
hours I 
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My Personalized Auction Site -My Recommended Auctions ; [ 
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Welcome to My Auction Site! - now personalized to you. We've presented auctions below which we 
hope you will find relevant to your interests. 
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Folder 0 
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# ( Time Auction 
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GREAT 12 DRAWER OAK FILE CABINET 



Old oak Infant/Toddler Sit-In Rocker 




50.00 0 



5.00 0 



6 days, 17 
hours + 



.6 days, 6 
hours + 



Old Solid Redwood Adirondack Chairs 



50.00 0 6 . days ' 16 
hours + 



Antique chair— take a iooki 



Antique Furniture: Explained and Illustrated 



11.00 1 



5.00 0 



6 days, 16 
hours -r 



6 days, 17 
hours + 



, Item(s) : 



Tnci flT~ 
Bids 



Time Auction 
Ends 



Roman SILVER Coin-240 AD. Military Theme !! 



18.95 4 l d W\ 3 
hours + 



My Interest 
Folder 1 



ROMAN COIN-365 AD 



Roman SILVER Coin-250 AD. ARMY in KOSO VO 



Greek SILVHR Coin-320 BC. Alexander the Orpnt 



9.95 18 



19.95 12 



1 days, 4 
hours + 



2 days, 3 
hours + 



49.95 10 4^.4 

hours -+- 



SH VFR Coin-! st Cent BC. Roma/Chariot !! LOOK 24.95 7 3 hours - 23 
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COMBINED DECLARATION FOR PATENT APPLICATION 
AND POWER OF ATTORNEY 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name, 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and joint inventor (if plural names are 
listed below) of the subject matter which is claimed and for which a patent is sought on the invention entitled: 

METHOD AND SYSTEM FOR INTERNET PERSONALIZATION 

the specification of which (check one) _x_ is attached hereto or was filed on as Application Serial No. and was 

amended on (if applicable). 

I hereby state that I have reviewed and understand the contents of the above-identified specification, including the claims, as amended by any 
amendment referred to above. 

I acknowledge the duty to disclose all information which is material to patentability as defined in Title 37, Code of Federal Regulations, 
Section 1.56. 

OOOOOOCK 3 0 OOCKXX>OOCNX^ 

I hereby claim foreign priority benefits under Title 35, United States Code, §119 of any foreign application(s) for patent or inventor's 
certificate listed below and have also identified below any foreign application for patent or inventor's certificate having a filing date before that 
of the application on which priority is claimed: 

Prior Foreign Application(s) Priority Claimed 

Yes No 

Number Country Day/Month/Year Filed 

Number Country Day/Month/Y ear Filed 

I hereby claim the benefit under Title 35, United States Code, §120 of any United States application(s) listed below and, insofar as the subject 
matter of each of the claims of this application is not disclosed in the prior United States application in the manner provided by the first 
paragraph of Title 35, United States Code, §1 12, 1 acknowledge the duty to disclose all information which is material to patentability as 
defined in Title 37, Code of Federal Regulations, Section 1.56 which became available between the filing date of the prior application and the 
national or PCT international filing date of this application: 

Application Serial No. Filing Date Status: Patented, Pending, Abandoned 



Application Serial No. Filing Date Status: Patented, Pending, Abandoned 

OOOOOOOOOOOCK>CKX>OOCKX^^ 

I hereby appoint the members of the firm of DERGOSITS & NOAH LLP, a firm composed of Michael E. Dergosits (Reg. No. 31,243), 
Todd A. Noah (Reg. No. 35,626), Richard A. Nebb (Reg. No. 33,540), Laurie J. Mintz (Reg. No. 35,957), Paul K. Tomita (Reg. No.: 43,196) 
and Geoffrey T. Staniford, (Reg. No. 43,15 1), Four Embarcadero Center, Suite 1150, San Francisco, California 941 1 1 as my attorneys with full 
power of substitution to prosecute this application and to transact all business in the Patent and Trademark Office in connection therewith. 

Please direct all correspondence regarding this application to the following: 

DERGOSITS & NOAH LLP 
Attn: Michael E. Dergosits 
Four Embarcadero Center, Suite 1150 
San Francisco, California 941 1 1 
Telephone: (415) 705-6377 
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I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief are 
believed to be true; and further that these statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment or both, under Section 1001 of Title 18 of the United States Code and that such willful false statements 
may jeopardize the validity of the application or any patent issued thereon. 



Full name of first joint inventor: Jonathan Oliver 

Inventor's signature: 



Date 

Residence: 733 - 65 th Street Oakland, California 94609 

Citizenship: Australian 

Post Office Address: same as above . 



Full name of second joint inventor: Rohan Baxter 

Inventor's signature: 



Residence- 485 Prince Street Oakland, California 94610 

Citizenship: Australian 



Date 



Post Office Address: same as above 



Full name of third joint inventor: Wray Buntine 

Inventor's signature: 



Residence: 1 126 Oxford Street Oakland, California 94707 



Date 



Citizenship : Australian 



Post Office Address: same as above 



Full name of fourth joint inventor: Steven Waterhouse 

Inventor's signature: 



Residence: 1660 Bay Street #203, San Francisco, California 94123 



Date 



Citizenship: U.S.A. 



Post Office Address: United Kingdom 
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